FINITE FIELD MODELS IN ARITHMETIC COMBINATORICS 



BEN GREEN 

Abstract. The study of many problems in additive combinatorics, such as Sze- 
meredi's theorem on arithmetic progressions, is made easier by first studying models 
for the problem in F™, for some fixed small prime p. We give a number of examples 
of finite field models of this type, which allows us to introduce some of the central 
ideas in additive combinatorics relatively cleanly. We also give an indication of how 
the intuition gained from the study of finite field models can be helpful for addressing 
the original questions. 



1. Introduction 

This article is concerned with a variety of problems in additive and combinatorial num- 
ber theory. The following two examples will convey the general flavour: 

Problem 1.1 (3-term APs). What is r^(N), the cardinality of the largest subset of 
{1, . . . , iV} containing no three distinct elements x, x + d, x + 2c? in arithmetic progres- 
sion? 

Problem 1.2 (Sets with small doubling). If A C Z, write A + A for the set of all sums 
a + a' , a, a' e A. What can be said about the structure of A if A is nearly closed under 
addition in the sense that \A + A\ ^ A^|/l|? 

What, then, is the "general flavour"? Of course, both of these problems are of an 
additive combinatorial flavour. Furthermore, they may both be asked in a general 
abelian group. Regarding Problem II .![ we may define the quantity r$(G) for any finite 
abelian group G. And Problem II .21 makes sense in any abelian group. 

The ability to generalise to an arbitrary G will be a common feature of many of the 
questions we discuss. An important observation is that not all abelian groups were 
created equal. It turns out that both Problems 11.11 and II .21 are both considerably easier 
in groups other than those in which they were originally asked (Z/iVZ for Problem II. if 
and Z for Problem II. 2j) . Indeed, Meshulam [H] observed that Problem II .11 is naturally 
addressed in Fg, whereas Ruzsa |Hj saw that Problem 11.21 is particularly pleasant in 
FJf . Here, F p denotes the finite field with p elements, and F^° is our notation for a 
vector space of countable dimension over F p . 

Roughly speaking, the reason that finite field models are nice to work with is that one has 
the tools of linear algebra, including such notions as subspace and linear independence, 
which are unavailable in general abelian groups. 

Historically, questions such as Problems 11.11 and 11.21 were investigated in their original 
settings, and it was observed only later that analogous arguments worked in the finite 
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field setting and in fact looked rather simpler. More recently, there has been a trend in 
the opposite direction. This has been fuelled by an idea of Bourgain JU] which, suitably 
interpreted, can be viewed as a way of converting arguments in the finite field setting to 
arguments which work for an arbitrary group G by using a kind of "approximate linear 
algebra". The author [26J produced a result about sets of integers with few solutions 
to x + y = z which would have been very difficult to attain without first considering a 
finite field model, and more work of this sort is in progress. It is an interesting feature 
of many problems that progress for the groups G which are "of interest", such as Z/iVZ, 
is scarcely simpler than for general abelian G. 

The format of this article is as follows. After setting up a little notation and a few 
definitions, we will discuss a number of finite field problems of "Szemeredi type", that 
is to say along the lines of Problem 11.11 We will strive for a uniform treatment of three 
such problems: 3-term APs O, right-angled triangles and 4-term APs O- We 
will discuss a fourth problem in JEl which concerns solutions to x + y = z and is in a 
somewhat similar spirit. 

After these four sections we will, in ^HJ sketch an argument of Bourgain, which is 
currently being developed by the author and others, including T.Tao and I. Shkredov, 
into a machine for converting arguments in the finite field setting into arguments that 
work in any finite abelian group G. This is often of some interest when G = Z/iVZ, 
because in that case it is often possible to infer results concerning the integers. 

After that there follow three further sections of a somewhat miscellaneous nature dealing 
with finite field analogues of problems in additive number theory. 

Since this is a survey article we have not gone into a great deal of technical detail. There 
are, however, two areas we discuss which are not well covered in the literature. Thus on 
the author's webpage one may find two supplementary documents I3()j. The first of 
these gives details of the finite field version of Shkredov's argument, which is outlined 
in §3 The second supplies proofs for the result of Ruzsa discussed in £ fTUl 

Our scope in this article is a little limited, in that our main interest is in additive 
combinatorial problems which can be usefully studied in F™ for fixed p, regarding n as 
a variable parameter. Secondly, I have unashamedly prioritised areas in which I have 
personally worked. There are most assuredly other areas of mathematics where finite 
field models have proved invaluable, such as the study of the Kakeya and restriction 
phenomena. We do not touch upon these matters here, referring the reader instead to 
the article as well as in the surveys [S3 EH1 EH] • 



2. Notation and Basic Definitions 

Let p be a prime (p will be either 2,3 or 5). Write ¥ p for the finite field with p elements, 
which may be identified with Z/pZ, and for an integer n ^ 1 write F™ for a vector 
space of dimension n over W p . This will be understood to have been given to us with a 
fixed basis (ei, . . . , e n ), relative to which we will occasionally write a given a; 6 FJJ as a 
coordinate vector (xi, . . . , x n ). We will always write N = p n for the cardinality of the 
space F". 
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Once we have a basis the Fourier transform of a function / : F™ — > C can be written 
down in a concrete form. A complete set of characters 7 : F™ — > S* 1 is given by the maps 
7f, defined by 

where (gF™ and to = e 2m / p . Thus, for any £ G F™, we define 



/(e) :=^/(x)#) = ^/(x)^. 

a; a; 

We may also write this as / A (£) on occasion. The basic facts concerning the Fourier 
transform are summarised in the following lemma. 

Lemma 2.1 (The Fourier Transform). Let f, g : F™ — > C be two functions. Then 

(1) 7(0) = £»/(*); 

(2) (Plancherel) Ex f( x )9(x) = N' 1 £ ? /(£)?(0/ 

(3) (Inversion) /(*) = iV" 1 E 5 /(CV^*/ 

(4) (Convolution) Write (f*g)(x) = 51 y f(y)g(x-y). Then (f * g) A (0 = f(0g(0- 

Very often, we will be concerned with functions / which are the characteristic functions 
of sets A C F™. It is very convenient to abuse notation and write A(x) for such a 
function. Thus A(x) = 1 if x 6 A, and A(x) = otherwise. This notation is by now 
reasonably widespread in the literature, as are alternative notations such as \a or 1a- 

It will be very convenient to use the language of conditional expectation. Suppose that 
x is a variable or set of variables, and that / is a real-valued function of x. Then we 
write 

E(f(x)\xeB) HBI" 1 ^/(ar) 



xeB 



for the average of f(x) over all x G B. 



Let us conclude with some notation concerning sumsets. If G is an abelian group and 
if A, B C G then we write i + 5 = {a + &:a6i4,l)6 B}. For any positive integers 
s, t we write + tB for the set of all sums a\ + • • • + a s + b± + • • • + &t,and — t-B for 
the set of all sums a\ + • • • + a s — b± — ■ ■ ■ — bt- 

3. Uniformity 

A notion which will feature repeatedly in this article is that of uniformity, also referred 
to in various related guises as regularity, pseudorandomness or quasirandomness. 

Definition 3.1. Let A C F™ be a set, and let 77 G (0, 1) be a parameter. We will say 
that A is 77-uniform if 

sup I %) I ^ V N - 

Observe that if A is ^-uniform then it is also //-uniform for all rf ^ rj. 

The basic philosophy behind this definition is as follows. A truly random set A (gen- 
erated, say, by including each x G F£ in A independently at random with probability 
1/2) will be ^-uniform with very high probability. In fact, using a large deviation esti- 
mate such as Chernoff 's bound (see [5] for example) one can show that this is true even 
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for T) = N~ l l 2+t . A truly random set will have many other properties almost surely. 
Remarkably, many of these are consequences of A being //-uniform. This phenomenon 
was investigated in the context of graphs by Thomason |fi2"t IfiH] and by Chung, Graham 
and Wilson ^3]. Chung and Graham later defined quasi-randomness for subsets of 
Z/iVZ. Quasirandomness has been most thoroughly explored in the context of graphs, 
for which the reader should consult the excellent survey articles |381 139j . The notions 
of uniformity in Z/iVZ and in F™ differ in little more than notation. 

As an example of uniformity/quasirandomness at work, and to get comfortable with 
the notation, let us prove that uniformity is more-or-less equivalent to a combinatorial 
condition involving M(A), the number additive quadruples in A (solutions to d\ + = 
a 3 + a 4 , di E A). 

Lemma 3.2. Let ACFJJ have cardinality aN. 

(1) Suppose that A is rj-uniform. Then M(A) ^ (a 4 + r] 2 a)N 3 . 

(2) Suppose that M(A) ^ (a 4 + e)iV 3 . Then A is e 1 ^ -uniform. 

Remark. An easy application of the Cauchy-Schwarz inequality confirms that M(A) ^ 
a 4 A^ 3 , so this lemma concerns sets with close to the minimum number of additive 
quadruples. 

Proof. The proof of this Lemma rests on the identity 

M(A) = N- 1 J2\A(0W 

which may be proved by observing that M(A) = ^ X {A * A)(x) 2 and using Lemma f2. II 

(2) and (4). To prove (1), assume that A is 77-uniform, so that |^4(£)l ^ f° r a U £ 0- 
Then we have 




= « 4 AT 4 + sup|2(0| 2 -«AT 2 ^ (a 4 + rfo)N\ 

as required. To prove (2), assume that M(A) ^ (a 4 + e)N 3 . Then for any £ 7^ one 
has 

|%)| 4 ^ ^|%)| 4 -|1(0)| 4 = NM(A) — |A| 4 < eiV 3 , 

which is what we wanted to prove. □ 

We observe that if A C F" and if H + g is a coset of some subspace H ^ F™, then there 
is a natural notion of what it means for A to be 77-regular relative to H + g. Indeed we 
may define a set A~^ 9 C H by setting 

A + H 9 {x) = A(x + g) 

for x G H . Since if is a subgroup, it is isomorphic to F™ for some m ^ n and it makes 
sense to talk about the Fourier transform on H . We say that A is ^-uniform on H + g 
if A^ g is ^-uniform, considered as a subset of H. 
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The key reason for uniformity being so important to us in the present survey is that 
it allows us to count solutions to certain linear equations in sets which are sufficiently 
uniform. Lemma f3. 21 was of course a rather special example of this (the linear equation 
being a± + a<i = a% + 04). The next proposition illustrates this further. 

Proposition 3.3. Let p be a prime and suppose that A C F™. Suppose that Ai, . . . , Xk, 
k ^ 3, are non-zero integers coprime to p. Let H ^ F™ be a subspace, and let g\, . . . , g k G 
Fp satisfy ^ = 0. Suppose that the density of A on H + gi is a i} and that A is 
rj-uniform. Then M, the number of solutions to A^aj = with G H + g^ for 
i = 1, . . . , k, satisfies 

\M-a x ... a k \H\ k - x \ < V^Vi • • • a fc ) 1/fc |#f 
Proof. With the notation introduced above we can write 

heH 

This can be written in terms of the Fourier transform on H as 

c 

Separating off the term £ = and bounding the other term using Holder's inequality, 

we get 

\M-a 1 ...a k \H\ k - 1 \ ^ [Hi' 1 J] \A^(XiO ■ • .^*(A*0I 

< sup 1 n^(«r- a/fc ■ n(E i^r (M)i 2 ) iA 

T T T 

This concludes the proof. □ 

Of particular importance to us will be two cases of the above with k = 3: (Ai, A2, A3) = 
(1, 1, —2), which corresponds to arithmetic progressions of length 3, and (Ai, A 2 , A3) = 
(1, 1, —1), corresponding to what are known as Schur triples (solutions to x + y = z). 

A particularly nice feature of finite fields is that the notion of a set A C F™ being 
uniform is closely related to that set being well-distributed in cosets of codimension one 
hyperplanes. We will use this principle several times in the sequel, so let us state and 
prove a quantitative version of it now. 

Lemma 3.4. Suppose that A C F™ is a set of size aN (N = p n ) and that A is not 

rj-uniform, so that there is £ 7^ with \A(£)\ > rjN. Let H = (£) _L , and write h(x) = 
H{x)/\H\. Then 

(1) E(A*h(x) 2 ) ^ a 2 + r] 2 ; 

(2) suv x A*h{x) + 

(3) supj. A * h(x) ^ a + rj/2. 
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Remark. A * h(x) is the density of A on the coset H + x. 
Proof. To prove (1), observe that 

N ^ A * h(x) 2 = £|A( 7 )| 2 |%)| 2 
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^ \A(OW\h(OW + \A(0\ 2 \h(0\ 
^ (a 2 + r] 2 )N 2 . 

Statement (2) is a simple corollary of this: 

aN sup A * h(x) = I ^ A * h(x) J sup A * h(x) ^ ^ A * h(x) 2 . 

X \ I x 

\ X / X 

Statement (3) is proved by working directly with the definition of A(£). It leads to 
somewhat better qualitative bounds than (2). 

Let, then, H + Xj, j = 0, 1, . . . ,p — 1, be a complete set of cosets of H. Then 



J 



where aj = \A D #j| — a|if|. Thus |a.j- 1 ^ r/A/". Observe, however, that Y2 a j = 0> 
it follows that ^ - |aj| + a,j ^ 77 Af, and whence from the pigeonhole principle that 
| a,- 1 + aj ^ rjN/p for some j. For such a j, we have a,- ^ rjN/2p. □ 



4. Roth's Theorem and the iteration method 
Let us begin by recalling Problem ll.il 

Problem 11.11 What is the cardinality of the largest subset of {1, . . . , iV} containing 
no three distinct elements x, x + d, x + 2d in arithmetic progression? 

This question was first raised by Erdos and Turan in 1936 ^Hj, and was addressed by 
Klaus Roth jJOj- Define rs(N) to be the answer to Problem 11.11 Roth proved that 
rs(N) <C Nj log log N, a bound which was improved to N(\ogN)~ c independently by 
Heath-Brown and Szemeredi [SZj, and then to r 3 (iV) <C iV(loglog N/logN) 1 / 2 by 
Bourgain ^Hj- We are still a long way from a complete understanding of rs(N); the best 
known lower bound is Behrend's [S] 1946 example showing that r 3 (N) ^> iVe _c ^ log7V . 

It is natural to define r^G) for any group G with no 2-torsion (though see [40J). A 
particularly appealing case, which fits with the discussion of this article, is G = Fg. In 
this case it turns out that the four proofs [TUJ EHl HH EI] can all be adapted to give the 
following result. 

Theorem 4.1. We have r 3 (F") < N/\ogN (= 0(3 n /n)). 

In fact, all four proofs look the same in the finite field setting. Roth's proof was adapted 
to the finite field setting by Meshulam [3T] and the argument we give to prove Theorem 
14.11 is the same as his. 
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There are two key ingredients. The first is a special case of Proposition 13.31 asserting 
that if A is sufficiently uniform then we can count solutions to the equation a\ + a 2 = 2a% 
(that is, arithmetic progressions of length three). 

Lemma 4.2. Suppose that iCIj has cardinality aN, and that A is rj-uniform. Then 
there are at least (a 3 — rja)N 2 solutions to the equation a± + a 2 = 2a^ with a» G A. In 
particular if r\ = a 2 /2 and N > 2/ a 2 then A contains a 3-term AP (x, x + d, x + 2d) 
with d ^ 0. 

Proof. The first part is just a matter of setting H = F3 and (Ai, A 2 , A 3 ) = (1, 1, —2) in 
Proposition EUD To verify the second statement, one must simply check that if r] = a 2 /2 
and N > 2/a 2 then (a 3 — rja)N 2 is greater than aN, the number of "trivial" 3-term 
APs (x, x, x) in A. □ 

The second key ingredient is Lemma 13.41 (3), which asserts that if A is not ^-uniform 
then it has increased density on some coset of a hyperplane. In combination with Lemma 
14.21 this leads naturally to an iterative method for proving Theorem 14.11 

Proof of Theorem \4-l\ Set Aq = A, Hq = F3, cto = a. For each i — 0, 1, . . . we perform 
the following algorithm: 

• If Ai is af/2-uniform then STOP. 

• Otherwise by Lemma f3. 41 find a hyperplane Hi + \ ^ Hi and an x G Hi such that 
\Ai n (H i+1 + x)\ > (ai + a 2 / A)\H i+l \. Now set A i+1 = (Ai - x) n H i+1 and set 
a l+1 = \Ai n (H l+1 + x)\/\H i+ i\. 

Note that if Ai contains a 3-term AP then so does A. 

The algorithm cannot be repeated forever, since the sequence (0^)°^ satisfies ao = a 
and ctj+i ^ ai + a 2 / 4 then a, > 1 for i > 50/ a. Thus we reach a STOP at step 
K of the algorithm, for some K < 50/a. At this stage, Ak is a^/2-uniform. If in 
addition \H%\ > 2ja\ then, by Lemma l4~2"| Ar- contains a 3-term AP. Since \Hk\ = 
3~ K N > 3~ 50 ' a N and > a, we see that the original set A contains a 3-term AP if 
a > C I log for some C. □ 

We call the above an iteration argument for obvious reasons. We will encounter several 
such arguments in this survey, so let us take the opportunity to look at the important 
features of it. 

Our concern was with certain configurations Config, which in this section were the 
three-term arithmetic progressions (x, x + d, x + 2d), d 7^ 0. 

A key feature of the argument was a collection Struct of structures, which in this case 
was the collection of all cosets of subspaces of F3. There was also some measure of 
the complexity uj(S) of a given structure S G Struct, this being the codimension of 
the subspace. For a given set A C F3 and for any S G Struct there was a notion 
of the density 5s (A) of A relative to S. Finally, there was a norm || ■ \\$ on functions 
/:£?—>• [—1, 1], for any S G Struct (in the example above, this was the L°° norm of the 
Fourier transform of /, regarded as a function on S). This we used to define a notion 
of uniformity relative to some S G Struct; a set was //-uniform if \\A — 5s(A)\\s ^ r). 
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The "iteration step" of Roth's argument can be presented in the following way. 

Let S G Struct, and let A C F3 be a set with 63(A) = a. Then one of the following 
three alteratives holds: 

(1) (generalised von Neumann theorem 2 ) \\A — 8s(A)\\s ^ « 2 /2, in which case A 
contains some c G Config; 

(2) (density increment) \\A — 8s(A)\\s > a 2 /2, in which case we may find S' G 
Struct, cu(S') ^ u(S) + 1, such that 6 3 >(A) ^ S S (A) + a 2 /4; 

(3) (endpoint) \S\ < 2/a 2 . 

Several subsequent arguments will have the same general form, with different notions 
of Struct, Config, u and || • \\s- The choice of Struct and, perhaps more importantly, 
of the norm || • \\$ is vitally important. || • ||g must be "strong" enough for us to be able 
to prove a von Neumann theorem, yet "weak" enough that one may obtain a density 
increment. 

To conclude this section, let use return to the question of estimating ^(Fg), which I 
regard as a very interesting one. It seems to dramatically expose our lack of under- 
standing of 3-term arithmetic progressions. There does not seem to be an analogue of 
Behrend's example in the finite field setting (Behrend's construction makes important 
use of convexity in ~R n ). The best known lower bounds on ^(Fg) come from design 
theory, where a set in F3 with no 3-term AP is known as a cap. Write f(n) for the 
cardinality of the largest cap in F3. In [Tj)| one finds the estimate 

/i(3) := limsup l^Cfw) ^ . 724851, 

which seems to be the best known. In that paper it is stated as an interesting research 
problem to determine if /i(3) = 1.1 believe that this is not so. 

Conjecture 4.3. /Lt(3) < 1. That is, there is an absolute constant 5 > such that 
r 3 (F") ^ (3-5) n . 

I would expect any methods used to make progress on this conjecture to assist with the 
Problem 11.11 At present the best known bound is that given in Theorem 14.11 

5. Right-angled triangles - an argument of Shkredov 
In this section we write V n = and N = \ V n \ =2™. 

We are concerned with a sort of two-dimensional generalisation of Problem 11.11 

Problem 5.1. What is r^(N), the cardinality of the largest subset of {1, ...,N} x 
{1, . . . , iV} containing no corner ((x, y), (x + d, y), (x, y + d)), d 7^ 0? 

Ajtai and Szemeredi [2] proved that r z (N) = o(N), and various subsequent authors 
[331 |Mj have obtained explicit bounds of the shape r^(iV) N/ilog^ N) c . Here log* iV 
is the number of times one must take the logarithm of N in order to produce a number 
less than 2. 

2 This term is one that Tao and I are trying to popularize to emphasise the connection with results in 
ergodic theory such as ^0 Lemma 3.1]. Such results tend to be established using several applications 
of the Cauchy-Schwarz inequality - see for example |33l §5]. The phrase "key lemma" was used for a 
related concept in the theory of graph regularity in the excellent survey |39j : now the more descriptive 
term "counting lemma" is popular (cf. |231 1261 B3J L 
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Very recently Shkredov produced the first "sensible" bound 

r z (N) < A7(logloglogiV) c . 

In this section we give the finite field version of his argument, in which the details are 
greatly simplified. 

Let G be an abelian group of size N, and consider the collection of corners in G x G, 
by which we mean triples ((x,y), (x + d,y), (x,y + d)), d ^ 0. Write for the 

cardinality of the largest set A C G x G which does not contain any corner. 

Theorem 5.2 (Shkredov). We have r z (¥ r 2 l ) < N 2 / (log logiV) 1 / 25 . 

It is natural to try and use the iteration method, in the form outlined in the previous 
section. The most naive attempt at doing this would involve taking Struct to be the set 
of cosets of products HxH, where H ^ ¥ 2 is a subspace, and the definition of uniformity 
to be much the same as before. The notion of having no large Fourier coefficients makes 
perfect sense in H x H . Unfortunately, however, this notion of uniformity is not subtle 
enough to give good control on the number of corners, essentially because it does not 
"see" the coordinate structure of H x H. The following example is instructive: 

Example 5.3. Let B be a random (and hence highly uniform) subset of V n with car- 
dinality f3N, and let A C ¥ 2 x ¥ 2 be the set B x B. Then A is also highly uni- 
form. The density of A is a = f3 2 . A corner in A corresponds to a quadruple of 
points (x, x + d,y,y + d) G B 4 , and we know from Lemma 13.21 that there are roughly 
f3 4 N 3 = a 2 N 3 such configurations. If A were truly random, however, it would have 
more like a 3 N 3 corners. 

The next idea, then, might be to define a somewhat finer notion of uniformity which 
respects the coordinate structure somewhat more. Using Proposition 13 .21 as a guide, we 
might define A C ¥ 2 x ¥ 2 to be rectilinearly r)-uniform if the number of configurations 
((x, y), (x+d, y), (x,y + e), (x + e,y + e)) in A 4 is at most (a 4 + r])A^ 4 . Such a notion does, 
as we will see, give some control on the number of corners in A. Unfortunately passing 
to a new structure S G Struct on which the density increases is now problematic. 

To see why, consider again example ()5.3|) . It is easy to see that A fails to be rectilinearly 
uniform, but there is no product set (H + x) x (H' + x'), H, H' large subspaces of Fj, 
on which the density of A increases markedly. 

Note, however, that in this example there is at least some structure on which the density 
of A increases, and that is the product set B x B (of course, the density of A on this set 
is one). This behaviour is more-or-less typical: if a set A C F2 x Fg has substantially 
more than a 4 iV 4 rectangles then it has increased density on some product B\ x B 2 . 
This can be proved graph-theoretically by associating to A the bipartite graph Ta with 
vertex sets two copies of F£ , an edge xy being deemed to lie in Ta precisely if (x, y) G A. 
A rectangle in A then corresponds to a copy of C4 in Ta, and we are reduced to showing 
that if has substantially more than a 4 iV 4 copies of C4 then there are large vertex 
sets Bi, B 2 such that the edge density of Ta restricted to B\ U B 2 is much greater than 
a. Shkredov in effect provides a spectral proof of this statement, which is in the spirit 
of JH]- A purely combinatorial proof is more traditional, and somewhat simpler - the 
details may be found in [2*§] . 
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The discussion of the last paragraph might suggest that we should enlarge Struct to 
include all translates of products B x B. This turns out to be too much of a compromise 
- one cannot establish a useful generalised von Neumann theorem. 

The above discussions motivate Shkredov's main advance, which is an appropriate def- 
inition of Struct. The definition depends on the global density a of A, a feature which 
has no analogue in other applications of the iterative method discussed in this paper. 

Definition 5.4. Let a > 0. Define Struct^ to consist of all translates of product sets 
S = Ei x E2, where Ex,E 2 are subsets of some H ¥ 2 , \Ei\ = (3i\H\ and Ei is a 
(2- 36 /5 1 12 /3 2 12 a 36 )-uniform subset of H for % = 1, 2. 

Definition 5.5. Suppose that S = E\ x E2 is a product set, and that f : S —> [—1, 1] 
is a function. Then we define the rectangle norm of /, ||/||s by 

||/|| 4 5 = E(f(x,y)f(x',y)f(x,y')f(x',y')\x,x' G E u y,y' G E 2 ) . 

It is not totally obvious that || ■ ||g is a norm, but this is in fact the case. Let us now look 
at how the argument fits together, starting with a generalised von Neumann theorem. 

Proposition 5.6 (Generalised von Neumann). Let S G Struct a; so that S = E\ x E2 
be a product set, where Ex,E 2 C H, \Ei\ = (3i\H\ and Ei is (2 -36 (3\ 2 (3 2 2 a 36 ) -uniform for 
i = 1,2. Let A C S be a set with Ss(A) ^ a. Suppose that and that \\A — Ss(A)\\^ ^ 
2~ 8 a 12 . Then A has at least a 3 Pf(3 2 N 3 /2 corners. 

The proof of this statement involves a number of applications of Cauchy-Schwarz. 

To complement the generalised von Neumann theorem, we must establish a density 
increment result. The following can be obtained by simple graph theory (or alternatively 
by spectral methods, as done in 

Proposition 5.7 (Density increment on a product set). Let S = Ex x E 2 be a product 
set, and suppose that A C S has d~s(A) = a and \\A — a||| ^ 77. Then there are sets 
Fi C Ei with ^ 2~ 8 r]\Ei\ such that the density of A on S' = Fx x F 2 satisfies 
5 S >(A) >a + 2~ u rf. 

Remark. There is no need to assume that the sets Ei, E 2 are uniform in this proposition. 
Proposition 15.71 has a significant deficiency, which means that it cannot be used in 
combination with Proposition 15.61 to provide an iterative proof of Theorem 15.21 This is 
that the sets Fi,F 2 which it outputs need not be uniform, and so it is quite possible 
that S' ^ Struct^. The following further result is required. 

Proposition 5.8 (Uniformising a product set). Let a,r,o~ G (0, 1) be parameters, and 
let S' = Fx x F 2 be a product set in W x W with = 5iN. Suppose that A C S' is a 
set with Ss'(A) = a + r, and that 

\W\ > exp(16(T" 2 r 1 r- 1 ). (5.1) 

Then there is a subspace W C W, dimW ^ dimly — 8cr~ 2 (5 _1 r _1 and tx,t 2 G W such 
that if E'x = {Fx - tx) n W, E' 2 = {F 2 - t 2 ) n W and S" = E[ x E' 2 then 

(1) \S"\ ^ 5xS 2 r\W'\ 2 /2; 

(2) E'x,E' 2 are 2o -uniform as subsets ofW; 

(3) 5 s »(A-(tx,t 2 )) ^a + r/8. 
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The proof of this theorem also proceeds by a version of the iterative method, and in this 
sense Skhredov's argument is a sort of double iteration method. The most important 
content of the proposition is that if S C W x W then we may pass to a translate of 
W x W on which S looks uniform, where W ^ W is a subspace of somewhat large 
codimension. If this really was our only aim, then we could proceed as follows. Either 
S is already uniform, or else S has a large Fourier coefficient £. In the latter case, S has 
increased density on some translate of ^ _L , by Lemma [3.41 (2). obviously contains a 
set of the form W x W', with W having codimension at most two. Now simply iterate 
the argument. 

The one further issue is that we also need to keep control of the density of A, which 
sits inside S. To achieve this it is necessary to partition W x W into pieces which are 
translates of products W x W', such that S is uniform on almost all of them. By a 
simple pigeonhole argument there must be some piece on which S is uniform, and on 
which the relative density of A is still quite large. Note that the subspaces W need 
not be the same for each piece; this is important from the point of view of obtaining 
bounds, or else one runs into examples such as that in §9 of [26J. 

To get this decomposition into pieces one uses the iterative argument with one small 
modification. At the jth stage of the iteration we will have a collection C\ of pieces, each 
being a translate of some product W x W. If c G Cj, write 5(c) for the relative density 
of S on the piece c. Our previous proposal was to ensure that swp ceCj 5(c) increases at 
each step of the iteration, this idea having served us well in the past. What one does 
instead is to increase the L 2 average E(5(c) 2 |c G Cj). This can be accomplised by using 
Lemma f3 .41 (1). 

Propositions 15 . 71 and 15 .81 together give the requisite density increment result to go with 
the generalised von Neumann theorem of Proposition 15.61 Thus we can employ an 
iteration argument. Working out the bounds gives Theorem 15.21 



6. Progressions of Length Four 
In this section we give another example of the iterative method at work. 

Problem 6.1. Estimate r^(N), the cardinality of the largest subset of {1, ... , N} con- 
taining no four distinct elements x, x + d, x + 2d, x + 3d in arithmetic progression? 

This question was, like Problem II .1| raised by Erdos and Turan in 1936. Szemeredi [HE] 
was the first to show that r 4 (N) = o(N). It was not until as recently as 1998 that the 
first "sensible" upper bound, r 4 (iV) iV(loglog N)~ c , was provided by Gowers |21j . 
Gowers' argument was iterative, like the arguments of 2] and 

Of course, one can define r 4 (G) for any abelian group G of size N. Recently, T.Tao and 
the author studied the case G = F^, starting from Gowers' work. Certain features 
of [21] become rather simpler in this setting, and we were able to run the iterative 
method quite efficiently, obtaining the following theorem. 



Theorem 6.2 (G.-Tao). r 4 (F") < iV(logA^)~ c for some c> 0. 
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Write Config for the collection of all four-term progressions in Fj?. Any hope of proving 
a generalised von Neumann theorem with the same uniformity norm that we used in 2] 
is dashed by the following example: 

Example 6.3 (Gowers; Furstenberg- Weiss). There is a set A C F£ with density 1/5, 
which is highly uniform, but which does not contain roughly 5~ 4 N 2 four-term arithmetic 
progressions. 

Proof. Let A = {x G Fg : x T x = 0}. Then A certainly has density approximately 1/5. 
To see that A is highly uniform, write 

^) = \ E E ^ xTx ~ ex ^EII-- A "' 

AeF 5 zeF™ A j=l 

If A then each term in the product has magnitude v^5, giving a total contribution 
of 5 n/ ' 2 ; if A = then, provided £ 7^ 0, at least one term in the product vanishes. It 
follows that sup^ \A(£)\ ^ 5 n / 2 = ^N. 

However, A has roughly 5 _3 iV 2 progressions of length four. Indeed, since A is so highly 
uniform we know from Proposition 13.31 that it contains roughly this many progressions 
of length three. However if x, x + d and x + 2d all lie in A then x + 3d e A automatically, 
in view of the easily verified identity 

x T x - 3{x + d) T (x + d) + 3(x + 2d) T {x + 2d) - (x + 3d) T (x + 3d) =0. □ 

Remark. Gowers has shown us an example of a subset of Z/iVZ which is uniform and 
has density a, but has many fewer than a 4 N 2 four-term arithmetic progressions. 

Similar examples can be constructed using any quadratic form q(x) = x T Mx + r T x + b 
in place of x T x. Remarkably, there are essentially no other examples. We shall formalise 
this statement in what follows. 

Definition 6.4 (Gowers norm). Let / : F5 — > [—1, 1] be a function. Then the Gowers 
?7 3 -norm of /, \\f\\uh is defined by ||/||^ 3 = 

E(f(x)f(x+a)f(x+b)f(x+c)f(x+a+b)f(x+a+c)f(x+b+c)f(x+a+b+c)\x, a, b, c). 

(6.1) 

Again, it is not completely obvious that || • ||[/3 is a norm, but this is not to hard to 
show. The following result is due to Gowers |21j . As with the other generalised von 
Neumann theorems we have mentioned, the proof involves several applications of the 
Cauchy-Schwarz inequality. 

Theorem 6.5. (Generalised Von Neumann theorem) Suppose that A C F5 has density 
a, and that \\A — a\\jj3 ^ <5 4 /100. Then A has at least a 4 N 2 /2 four-term arithmetic 
progressions. 

The next theorem is proved in [34J by adding a single new idea, the so-called "symmetry 
argument", to the ideas of Gowers [21]. This theorem clarifies the sense in which the 
"quadratic" examples of Furstenberg and Weiss are in a sense the only ones: 

Theorem 6.6 (Gowers; G.-Tao). Suppose that \\A — a\\u3 ^ 5. Then A has quadratic 
bias, meaning that there is some quadratic form q(x) = x T Mx + r T x + b such that A 
has density at least a + C(5) on the zero set S = {x : q(x) = 0}. 
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The reader may note that these two theorems do not, in their present incarnations, 
dovetail together to give an iteration argument because there is no natural definition 
of Struct. Roughly speaking, Gowers took Struct to be the collection of translates of 
subspaces of F£. It is possible to deduce from the conclusion of Theorem 16 . 61 that A has 
increased density on some S G Struct, but unfortunately the codimension u(S) might 
be exceedingly large (perhaps n — n 1 / 100 ). This does not, then, lead to a very efficient 
iterative argument. 

In [S3] a much less appetising approach is forced to work, which leads to superior bounds. 
Roughly, this involves taking Struct to be the collection of all quadratic submanifolds, 
our name for an intersection 

A: 

S=f]{x: gj (x)=0}, 

3=1 

where qj(x) = x T MjX + rJx + bj are quadratic forms. The "roughly" is quite important. 
We must in fact assume that S is "generic" , meaning that the matrices Mj are not too 
linearly dependent. In practise this means that they satisfy a rank condition such as 
rk(AiMi + - • - + AfcMfc) ^ 10k for all possible choices of scalars Xj G F 5 . We also allow our 
quadratic forms to be defined only on a subspace W ^ Fg of not-too-large codimension. 
This is because of the very useful observation that an arbitrary quadratic submanifold 
can be made generic after passing to an appropriate subspace W. 

Generalising the notion of Gowers [7 3 -norm to such a setting is straightforward; in fact 
the definition is the same except that the expectation in (j6.1|) is taken over S. Proving 
an analogue of Theorem 16.51 is substantially more involved, but it is possible and reads 
as follows. 

Theorem 6.7. Let S G Struct. That is to say, S is a generic quadratic submanifold 
in some W ^ Fg ; this being the zero set of some k quadratic forms qi, ■ ■ ■ ,qk on W . 
Then S has approximately \S\ 3 /\W\ four-term arithmetic progressions. 

(1) (Generalised von Neumann theorem) Suppose that ACS has density a, and 
that \\A — a||[/3(5) ^ a 20 . Then A has at least a A \S\ 3 /2\W\ four-term arithmetic 
progressions. 

(2) (Gowers-type inverse theorem) Suppose that \\A — a\\us(s) 5. Then A has 
quadratic bias, meaning that there is some quadratic form qk+i(x) = x T Mk+ix + 
r^+iX + bk+i such that A has density at least a+C(S) on the set Sn{x : qk+i(x) = 
0}. 

A key feature of the theorem is that the density increment C (5) is independent of the 
number of quadratic forms k. The proof of the theorem is long and somewhat difficult 
and occupies the bulk of pEfj . 

Theorem 16.71 of course allows one to set up an iteration scheme. If A C Fg is a set with 
density a which contains no four-term progressions, then one may find a sequence 

of generic quadratic manifolds, defined on subspaces 

F™ = Wo ^ W l ^ W 2 ^ . . . 
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such that the density of A on Sj is at least a + jC(a 20 ). This leads to a contradiction 
after C^a 20 )' 1 iterations. 

Unfortunately, this still leads to a bound of the shape r^F??) <C iV(loglog N)~ c , since 
we have only been able to establish Theorem 16.71 with a function C(5) which behaves 
like exp(— 5~ B ), and this results in a very large number of iterations. We conjecture 
that a better bound holds, but we cannot prove this even in the less general context of 
Theorem 16.61 I regard this as one of the key open questions in this area of arithmetic 
combinatorics. 

Conjecture 6.8 (Polynomial Gowers Inverse Conjecture). Let f : Fg — > [—1,1] be a 
function with E/ = 0. Suppose that \\f\\u 3 ^ 8- Then there is a quadratic form q on Fg 
such that 

\Ef(x)w q W\ >5 C , 

for some absolute constant C . 

We do know this with 5 C replaced by a function of exponential type. An affirmative an- 
swer to the PGI conjecture would be implied by an affirmative answer to the Polynomial 
Freiman-Ruzsa conjecture (PFR), which is discussed in some detail in < J1UI 

Fortunately, for the purposes of obtaining a bound on r^Wg) one can get by with a 
weaker conclusion in Theorems 16.51 and 16.61 In Theorem 16.51 one can obtain a "poly- 
nomial" density increment, leading to a much shorter iterative process, by passing to a 
set of the form {x : q(x) = 0} PI (W + 1), where W C F^ is a subspace. One can allow 
the codimension of W to be a power of a -1 , which is just as well since this is the best 
bound we have. 

7. Szemeredi Regularity in Groups 

The object of this section is to state some results and open problems from (2^]. The 
results are slightly different from those in the previous section in the problem addressed 
is not quite of "Szemeredi type". However what we discuss here is certainly in a similar 
spirit, being concerned with solutions of linear equations in sets of integers, and can 
furthermore be interpreted as an application of the iteration method. 

We will be somewhat brief: more details can of course be found in the paper [2H] itself, 
which is written from a viewpoint rather similar to that of the present survey. 

Szemeredi's regularity lemma is a famous result in graph theory. It can be regarded as 
structure theorem for all graphs, in the sense that it shows that one can decompose a 
completely arbitrary graph into a bounded number of pieces, almost all of which are 
pseudorandom. There are many excellent articles on this topic - see for example |39j . 

One consequence of Szemeredi's regularity lemma is the following interesting result. 3 

Theorem 7.1. Let T be a graph on N vertices, and suppose that one must remove 
5N 2 edges from G in order to destroy all triangles in T. Then F has at least Ci(S)N 3 
triangles, for some C(5) > 0. 

3 We have not attributed this result, as it is not clear to us where it was first stated. A slightly 
weaker result was obtained by Ruzsa and Szemeredi in 1976 [52] . The result is also well-known in the 
literature concerning "property testing": see, for example, 
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Put another way, if a graph is almost triangle-free (i.e. contains few triangles) then it 
can be made truly triangle-free by the removal of a small number of edges. 

Our investigations in were motivated by an "arithmetic" question related to the 
above theorem. 

Theorem 7.2 (See |2Ej)- Let G be an abelian group of size N , and suppose that A C G 
is a set. Suppose that one must remove 5N elements from A in order to create a sum- 
free set (that is, a set with no solutions to x + y = z) . Then A has at least C2(S)N 2 
Schur triples (triples (x, y, z) for which x + y = z). 

This result may be regarded as a structure theorem for sets which are almost sum-free; 
they can be made truly sum-free by the removal of a few elements. 

This theorem is deduced from a result which we call a Szemeredi-type regularity lemma 
for abelian groups. This result is a perfect example for the present survey, since in the 
context of a general abelian group it requires substantial preparation to even state the 
result. When G = however, things are much easier. 

Theorem 7.3 (Regularity lemma for FIJ). Let A C F£ be a set, and let e > be a 

parameter. Then there is a subspace H C F£ with codimension at most M(e), and such 
that A is e-uniform on at least a proportion 1 — e of the cosets of H . 

Let us say, for the rest of this section, that A is e-regular relative to H if it satisfies the 
conclusion of this theorem. 

Let us sketch the deduction of Theorem 17.21 from Theorem 17.31 Suppose that A C F£ 
is a set with the property that one must remove at least 5N elements from A to leave 
a set which is sum-free. Apply Theorem 17.31 with e = (<5/10) 3 , giving a subspace H of 
codimension at most M(e) such that A is e-uniform for a proportion at least 1 — e of 
the cosets of H. For each coset H + x, we ask two questions: 

• Is A e-uniform on H + x? 

• Is the density of A on H + x at least (2c) 1 / 3 ? 

If the answer to either of these questions is no then we simply remove all of A D (H + x) 
from A. Let the set remaining after we have asked the above questions for all cosets 
H + x be called A'. It is easy to see that 

\A'\ > \A\ - 10e 1/3 N = \A\-SN. 

We claim that A' is sum-free. Indeed, were it not there would be xi, X2, x% with x\ +X2 = 
£3, such that A is e-uniform and has density ctj ^ (2c) 1 / 3 on each H + Xi. By Proposition 
13.31 this means that K, the number of solutions to a\ + 02 = 03 with <2j G A fl (H + Xj), 
satisfies 

\K — aia2a 3 \H \ 2 \ ^ e\H\ 2 , 

which means that K ^ e\H\ 2 /2. Thus certainly the number of Schur triples in A is 
certainly at least e\H\ 2 /2, which is at least e2~ 2M ^- 1 N 2 . 

The proof of Theorem 17.31 is very much in the spirit of the iterative method. One again 
takes Struct to be the collection of all subspaces H ^ FJJ , but here there is no Coring. 
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Let A C F£ be a set, and let H 6 Struct. We define the L 2 -density of A with respect 
to by 

In j2E] this is called the index, and is written ind(A; H). 

The key to the proof is the following lemma (Lemma 2.2 of 26J), which can be proved 
by elaborating somewhat on the proof of Lemma f3. 41 (1). 

Lemma 7.4. Let e E (0, |) and suppose that H F£ is a subgroup which is not e- 
regular for A. Then there is a subgroup H' ^ H such that codim(if') si 2 codim W and 
M^) ^5 H (A) + e 3 . 

Theorem 17. 31 is simply a matter of applying Lemma f7.4l iterativelv. Since (A) ^ 1 for 
any H, the number of iterations is no more than [1/e 3 ]. 

An unfortunate feature of Theorem 17.31 and its proof is that M(e) grows like a tower 
of twos of height e~ 3 . This is because each application of Lemma 17.41 results in an 
exponentiation of the codimension of H. By adapting a brilliant construction of Gowers 
[20 , which shows that Szemeredi's regularity lemma for graphs must have tower type 
bounds, we were able to show that M(e) must be at least as bad as a tower of twos of 
height about log(l/e). 

We were not able to produce a similar example in the setting of Theorem 17.21 

Problem 7.5. Find a "reasonable" bound for C*2(5), the quantity appearing in Theorem 
17. 2| or prove that no such bound exists. 

In fact for G — I am not able to exclude the possibility that C2{5) can be a polynomial 
in 5. This need not be the case for G = Z/iVZ, due to the Behrend example of a large 
subset of {1, . . . , N} containing no 3-term AP. See j2H| fo r a further discussion. 

The corresponding for graphs (relating to Theorem 17. lj) is also wide open, though again 
it is known that C\{5) cannot be taken to be polynomial in 5. 

8. From Finite Fields to {1, . . . , N} 

We have now seen several examples concerning additive combinatorics in finite fields. 
However, for many of the problems we have considered it is an analogue in {1, . . . , iV} 
or (more-or-less equivalently) in Z/iVZ which is actually of interest. 

In recent years the passage from finite fields to the integers, at least for problems 
concerning configurations of the type we have been discussing in the last four sections, 
has started to form into something resembling a theory. This is thanks to the work of 
Bourgain JU] on finding good bounds for r 3 (N). 

Bourgain's ideas are developed in detail in his original paper, of course, and have also 
been discussed in and jHOl- I n this section we restrict ourselves to a few remarks 
which illustrate the important points. 



Consider the problem of finding a bound for r 3 (G) using the iteration method, where G 
is an abelian group with order N and no 2-torsion. It is not hard to see (essentially by 
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changing the letter £ to 7 in Proposition I3.3|) that if A C G has density a and substan- 
tially fewer than a 3 N 2 3-term APs then A has a non-trivial large Fourier coefficient, 
that is to say 

%) :=^A(x) 7 (x) 

X 

has magnitude a large fraction of N for some non-trivial character 7 e G*. 

It is not immediately clear how to use this information. We can no longer assert that A 
has increased density on a subspace, because in a general group G there is no such thing 
as a subspace. What we can show, rather painlessly, is that A has increased density on 
a translate of a Bohr set, that is to say a set of the form x + B({7}, e), where 

5({ 7 },e) :={xeG: \1 - 7 {x)\ ^ e}. 

The reader who has followed the various iterative arguments in the last four sections 
might now suggest that we define Struct to be the collection of all Bohr sets B(T,e), 
where T = {71, . . . , 7^} is a set of characters and 

B(T,e) :={xeG : \l - jj{x)\ sC e for all j = l,...,d}. 

Note that in F3 a Bohr set is the same thing as a subspace when e < 1/4. Such a 
strategy is clearly not going to be without its difficulties. If S G Struct, it looks as 
though we are going to have to make some sense of what it means to do Fourier analysis 
on S. Since B(T, e) is not a group, this will certainly not be a trivial matter. 

In fact, B = B(T,e) is quite a long way from being a group. The homomorphism 
(71, . . . , 7^) : G — > T d carries B into a small d- dimensional box D. If one picks x, x' at 
random in D, the chance that x + x' G D is just 2~ d . Hence one expects that typically 
\B + B\ ~ 2 d |5|, which compares unfavourably with the result \H + H\ = \H\ which 
holds if H ^ G is a genuine subspace. 

We will not, in this survey, go into the details of what we mean by Fourier analysis on 
B, nor how the large doubling constant of B is unpleasant in this context. We hope the 
reader will believe us when we say that reducing the doubling constant is a very helpful 
thing to do. 

Bourgain's advance is to consider B not by itself, but together with another Bohr set 
B' := B(T, e'), where e' is much smaller than e . Then if x G B and x' G B' we 
have x + x' G B(T, e + e'), a set which ought not to be much larger than B. Thus 
\B + B'\ ~ \B\, and we may think of the pair (B, B') as behaving like an approximate 
group. Roughly speaking, it turns out to indeed be possible to run an iterative argument 
in which Struct is the collection of all such pairs (B, B'). 

There are a number of further technicalities to be overcome. One interesting one is that 
our assertion that B(T,e + e') is not much larger than B(T, e) is not true in general. 
Suppose, for example, that G = Fg, that the characters in T are linearly independent 
and that e < 2sin(7r/5),e + e / > 2sin(vr/5). Then \B(T, e)| = 5 n ~ d , whilst \B(T,e + e') \ = 
3 _d 5 n . Bourgain circumvents this difficulty by using an averaging argument to show 
that for a typical e the size of B(T,e) is roughly invariant under small perturbations 
of e. Tao [HD] observed that one could also replace Bohr sets by smoothed Bohr sets, 
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and then such difficulties go away. I implemented this idea slightly differently in [26J, 
defining the a smoothed Bohr "set" by 



We conclude this section by giving an up-to-date summary of the extent to which the 
problems of the last four sections have been given Bourgain's treatment. Of course, 
in the original paper jTHj the question of 7*3 (Cr) was treated (actually, Bourgain only 
treats r^(N) but it is clear that his methods work in an arbitrary G). In [2E] the results 
of ^3 are all fully generalised to any finite abelian G, and in particular Theorem 17.21 
is proved in this general setting. As regards adapting the methods of £0 to obtain 
a bound of the form rz(G) <C iV(loglog N)~ c , this ought to be possible (Shkredov, 
work in progress). Finally there is the issue of transferring the arguments of ^Hl to 
obtain a bound of the form r 4 (G) <C N(logN)~ c . In particular one would like this for 
G = Z/iVZ, which would imply that r 4 (iV) <C N(log N)~ c . Since the argument for 
r 4 (Fg ) is already rather difficult, one should not expect this to be at all straightforward. 
Even describing the correct generalisation of the notion of quadratic form to an arbitrary 
G is not straightforward [35] . 



As promised, we now move onto questions of a somewhat more miscellaneous nature. 
This section concerns the following problem. 

Problem 9.1. Let A C {1, . . . , N} be a set of size N/10 (say). Must A + A contain a 
long arithmetic progression? 

Bourgain [§| proved that the answer is "yes"; A + A must contain a surprisingly 
long arithmetic progression. If L(N, a) is the smallest I for which there is a set 
A C {1, . . . ,N} of cardinality aN such that A + A does not contain a progression 
of length I, then Bourgain showed that L(N, 1/10) 3> exp(c(logiV) 1 / 3 ). In this 
was improved to L(N, 1/10) ^> exp(c(log N) 1 ^ 2 ). An example of Ruzsa [Slj shows that 



It seems as though the natural finite field analogue of Problem 19.11 involves replacing 
"arithmetic progression" by "coset of a subspace" . 

Problem 9.2. Write D(n,a) for the smallest d for which there is A C F£ of density 
a such that A + A does not contain a coset of a subspace of dimension d. Estimate 
D(n, a). 

The techniques of j22] adapt to this situation in a straightforward manner, and one 
obtains the following. 

Theorem 9.3. Suppose that a ^ n^ 1 ^. Then D(n,a) J? a 2 n/80. 

A detailed proof of this fact may be found in [21] ■ In keeping with the philosophy of this 
survey, some of the details are rather cleaner than in the orginal argument [23] which 
applied to subsets of {1, . . . , iV}. 

A more dramatic difference between the finite field case and the original setting of 
Problem 19. II can be observed when one tries to adapt Ruzsa's construction to the finite 
field setting. 




P -t/e 

B(K,t){x) dt. 



9. Progressions in Sumsets 



L(N, 1/10) < exp(c e (logiV) 2 / 3+£ ). 
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Theorem 9.4 (Ruzsa's niveau sets in W%). D(n, 1/4) $C n — y/n. 

Proof. Let A be the set of all vectors i G with at least n/2 + y/n/2 ones with 
respect to the standard basis. By the central limit theorem the number of ones in a 
random vector (x\, . . . , x n ) is roughly normally distributed with mean n/2 and standard 
deviation y/n/2, and so for large n we have \ A\ ^ 2 n ~ 2 . Now any vector x E A + A must 
have at least y/n zeros. Using this fact, we shall prove that A + A meets all translates 
of all (n — L\/^-J )-dimensional subspaces. Indeed, write d = [\/n\ and suppose that U 
is a translate of some subspace of dimension n = d. U can be written as 

U = {a + Aiai H h \ n -dan-d ■ A, G F 2 } , 

where the are linearly independent. Write aj in component form as (ap^)™ =1 . The 
column rank of the matrix (a^) is n — d, and hence so is the row rank. Without loss 

of generality, suppose that the first n — d rows (a^, . . . , a„_ d ), j = 1, . . . ,n — d, are 
linearly independent. Then we can solve the n — d equations 

a + Al<4^ + ■ • ■ + X n -d a n-d = 1 

for the Aj, giving a vector in {7 with no more than d zeros. □ 

Problem 9.5. Narrow the gap between Theorems 19.31 and 19.41 

My suspicion is that the upper bound of Theorem 19.41 is closer to the truth. 

I cannot resist mentioning two problems which were raised at the AIM conference on 
additive combinatorics. The first is due to Croot: 

Problem 9.6. Fix 6 G (0, 1). What is 

1(9) = limsup min (length of the longest progression in A + A)l 

N^oo AC{N],\A\=m-<> 

In words, we are interesting in finding subsets A C [N] with density N~ 6 such that A+v4 
contains no long arithmetic progression. Croot states that the bounds 2/0 — 1 ^ 1(9) 
2 1 /® are known. The upper bound comes by considering a multidimensional progression 
of dimension about 9 log 2 iV: it would be interesting to see whether a construction 
related to niveau sets gives anything better. 

The second question is due to Katznelson: 

Problem 9.7. What is the measure of the largest open subset A of the torus T d for 
which A — A does not contain a 1-dimensional subgroup? In particular, is it 2~ d l 

10. Freiman's Theorem 

A great deal of the material in this section was communicated to me in person by 
Imre Ruzsa, and is reproduced here and in the supplementary document [30J (which 
contains proofs) with his kind permission. The reader will also wish to consult Ruzsa's 
own survey article [o 1 . as well as the material from the AIM conference on Additive 
Combinatorics pQ. 

This section concerns Problem 11.21 of the introduction. Let A C have doubling at 
most K, meaning that we have the inequality |v4 + v4| ^ -f^|^4|- What can be said about 
the structure of A? 
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It is hard to think of any examples of sets A with this property other than cosets of 
subspaces, and large subsets of them. In fact, these are the only such examples as was 
shown by Imre Ruzsa >47]. The best known bounds for a result of this type are due to 
Ruzsa and the author [3T] : 

Theorem 10.1 (Freiman's theorem in Ff 3 ). Let A C F£° be a finite set with \A + 
A\ ^ ^|^.|- Then A is contained within a coset of some subgroup H ^ F£° with 
\H\^K 2 2 2K2 - 2 \A\. 

A version of this result, with somewhat weaker bounds, will be a consequence of Propo- 
sition below (which is also due to Imre Ruzsa). 

Theorem 1 1 . 1 1 gives . in a weak sense, a complete description of sets with small doubling. 
We showed that if \A + A\ ^ K\A\ then A is contained in a coset of a subspace of size 
at most K 2 2 2K ~ 2 \A\; conversely, if A has this property then it is clear that \A + A\ ^ 
K 2 2 2K ~ 2 \A\. It would be of great interest to have a structure theorem which does not 
result in exponential losses in K of this sort. Perhaps one can even arrange things so 
that one has a result of the form 

doubling constant K ==>■ structure ==>■ doubling constant K', 
where K' is polynomial in K. 

It is easy to see that such a structure theorem would have to take a form somewhat 
different from Theorem 110.11 Indeed if one takes A C FJf to be a subspace H together 
with K points X\, . . . , Xk such that Span(a;i, . . . , %k) H H — {0} then it is clear that 
|v4 + v4| ^ -K'l^l, but that the smallest coset-of-a-subspace containing A has size roughly 
2 K \A\. 

Ruzsa [H| reports that Katalin Marton has suggested that one should be looking for a 
covering of A by a small number C\ (K) of cosets of some rather smaller subspace of size 
C^if) I agree with this, and it is to some extent believeable that C\{K) and C^-fT) 
can be polynomial in K. Ruzsa was probably the first to actually dare to conjecture 
this, and he certainly states such a conjecture explicitly in jjl]. Such matters are also 
touched upon (in the Z-setting) in [TH 

Imre Ruzsa indicated to me a large part of the following proposition giving a number 
of statements equivalent to such a structure theorem. The proof may be found in [30J. 

Proposition 10.2 (Ruzsa). The following five statements are equivalent. 

(1) If A C Wf has \A+ A\ K\A\, then there is A' C A, \A'\ ^ \A\/d(K), which 
is contained in a coset of some subspace of size at most C2{K)\A\. 

(2) If AC. F^° has \ A + A\ ^ i^|^4|, then A may be covered by at most C%{K) cosets 
of some subspace of size at most C±{K)\A\. 

(3) If A C F^° has \A + A\ ^ ^|^.| ; and if additionally there is a set B, \B\ ^ K, 
such that A + B = A + A, then A may be covered by at most C§(K) cosets of 
some subspace of size at most Cq(K)\A\. 

(4) Suppose that f : F^ — * F^° is a function with the property that \{f(x) + f(y) — 
f(x + y):x,yE F™}| ^ K . Then f may be written as g + h, where g is linear 
and\Im{h)\^C 7 (K). 
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(5) Suppose that f : F™ — ► F£° is a function with the property that for at least 
2 3m /K of the quadruples (xi, x 2 , x 3 , x 4 ) G F™ with x\ + x 2 = x 3 + x 4 we have 
f(xi) + f(x 2 ) = f(x 3 ) + /(x 4 ). Then there is an affine linear function g : F™ — > 
F^° such that f(x) = g(x) for at least 2 m /C%(K) values of x. 
Furthermore if Ci(K) is bounded by a polynomial in K for all i e I, where I is any of 
the sets {1, 2}, {3, 4}, {5, 6}, {7}, {8} then in fact Ci{K) is bounded by a polynomial in 
K for all i. 

Remarks. Statement (4) is perhaps the most elegant and natural one here. Observe also 
that (4) is rather easy with the bound C-j(K) = 2 K . Thus Proposition 110.21 implies a 
weak version of Theorem llO.il It is the possibility of polynomial bounds for Ci(K) that 
is the most interesting feature of this proposition. Let us call this the PFR conjecture: 

Conjecture 10.3 (Polynomial Freiman-Ruzsa conjecture for FJJ). The function Cj{K) 
[and hence all of the other functions Ci(K), i = 1, . . . , 8), can be taken to be polynomial 
in K. 

The following question has implications for PFR. 

Question 10.4. Let A C Fg be a set of density a. Then 2A — 2A contains a subspace 
with co dimension f(a). What is the behaviour of f{ct)l 

Using a Fourier-analytic technique of Bogolyubov [H] one may show that f(a) <C a~ 2 , 
and a refinement of this technique due to Chang allows one to improve this to 
f(a) <C a -1 log(l/a). We have not been able to rule out the possibility that f(a) <C 
log(l/a), which if true would imply PFR. 

The proof of Proposition 110.21 uses an important result known as Plunnecke's inequality 
|4~5] , a new proof of which was found by Ruzsa ■ This states that if A is a subset of any 
abelian group G, and if |yl + A| ^ if then we have the inequality \sA — tA\ ^ iC +t |A| 
for any positive integers s, t. The reader may observe that (1) of Proposition lTo~2l implies 
a much stronger bound for some large subset A' C A, for large s, t, at least if there is 
a good bound on C 2 {K). We may call such an A' subpliinnecke. Nets Katz asked me 
to formulate a principle to the effect that A being subpliinnecke implies that A is very 
economically contained in some coset of a subspace. The following result is my best 
effort so far in this direction: 

Proposition 10.5. Let A C F^°, and suppose that there is a constant B such that 
\tA\ ^ t B \A\ for all t ^ BlogB. Then A is contained in a union of 2 CBlogB cosets of 
some subspace having size at most \A\. 

The hope, of course, is that one might be able to show that if \A + A\ ^ K\A\ then A 
has a large subset A 1 which is subpliinnecke in the sense of Proposition 110.51 f° r some 
reasonably small B (ideally, B = O (log Kj log log K), which would imply PFR). 

For me the most important reason for wanting to understand the PFR conjecture is the 
implications it would have for our understanding of quadratic Fourier coefficients. In 
particular, PFR in Fg (the formulation is obvious) would imply a positive solution to 
the PGI Conjecture f Conjecture 16. 8|) . 

Proposition 10.6. Suppose that PFR is true in Fg. Then PGI is true. That is, let 
f : Fg — > [—1, 1] be a function with E/ = 0, and suppose that \\f\\u 3 ^ Then there is 
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a quadratic form q on such that 

\Ef(x)cu^ x) \ >5 C , 

for some absolute constant C. 
The deduction is given in [35] . 

In my opinion it would be very interesting to determine whether PGI has any implica- 
tions for PFR. It is just plausible that this represents the most natural way to attack 
PFR, though at the moment we have little idea how to carry out such a programme. 

The results of this section may be discussed in the context of general abelian groups G. 
However, the issues are of a rather different nature to those discussed in $HJ Freiman's 
original work concerned subsets of Z, and was quite geometric in feel. See 0113 E2j for 
a further discussion. Ruzsa's proof |lHj has proved much more adaptable, and recently 
Ruzsa and the author [32J were able to obtain a structure theorem for sets with small 
doubling which is valid in any abelian group. 

Theorem 10.7 (G. - Ruzsa). Let G be an abelian group, and suppose that A C G 
has \A + A\ ^ -K"|^4|- The A is contained in a set of the form H + P, where H is a 
subgroup, P is a generalised arithmetic progression, the dimension of P is ^ Cg(K) and 
\H\\P\<:C W (K). 

Remark. A generalised arithmetic progression of dimension d is a set of the form 

{a + Aiai H h X d a d : ^ Aj ^ Lj for i = 1, . . . , d.} 

We obtain the bounds Cg(K) <C K c and C\o(K) <C e K ° , for some absolute constant 
C. 
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